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Summary 

Seven  lines  of  experimentation  have  been  carried  out  over  three  years.  In  the 
first  set  of  experiments,  the  PI  has  tested  and  rejected  a  two-process  model  of 
visual  attention  allocation.  He  has  proposed  an  alternative  perceptual  sampling 
model  and  performed  stochastic  simulations  of  the  model  to  show  that  it  can 
account  for  certain  aspects  of  human  performance  in  cued  visual  search  tasks.  In 
the  second  set  of  experiments,  the  PI  has  found  evidence  that  observers  perceive 
occluded  objects  across  time,  a  finding  that  complements  an  analogous  ability  to 
perceptually  complete  partially  occluded  objects  across  space.  Several  lines  of 
experimentation  have  been  carried  out  using  a  bistable  apparent  motion  display  (the 
Ternus  display)  as  a  tool  to  explore  the  assignment  of  object  identity  over  time.  For 
example,  the  PI  has  found  evidence  that  a  common  mechanism  may  underlie  the 
perception  of  bistable  apparent  motion  and  the  capture  of  visual  attention  in  certain 
visual  search  tasks,  and  he  has  discovered  that  perceptual  grouping  by  proximity 
can  precede  the  assignment  of  motion  correspondences  in  bistable  apparent 
motion.  In  the  fifth  project,  the  PI  has  shown  that  visual  salience  is  not  sufficient 
to  produce  attentional  capture;  a  deliberate  state  of  attentional  readiness  is  required 
to  guide  attention  according  to  salient  stimulus  attributes.  In  the  sixth  of 
experiments,  the  PI  is  examining  the  attentional  demands  of  perceptual  grouping 
and  visual  "pop-out."  Finally,  the  PI  has  carried  out  preliminary  experiments 
assessing  the  common  assumption  of  serial  processing  in  visual  search;  the 
observations  seriously  challenge  the  assumption  and  thereby  undermine  many 
currently  influential  theories  of  search. 

Status  of  the  Research 

This  report  describes  the  technical  progress  achieved  in  "Stochastic  Models 
of  Attention  and  Search,"  supported  by  AFOSR  Grant  F49620-92-J-01 86. 

1.  Tests  of  a  two-process  model  of  visual  attention  allocation  (Johnson  &  Yantis, 

in  press) 

In  the  experiments,  subjects  detected  the  presence  or  absence  of  a 
prespecified  target  element  appearing  in  an  array  of  four  elements.  A  spatial  cue 
appeared  200  ms  before  the  onset  of  the  display,  indicating  a  location  in  the 
upcoming  array.  In  separate  blocks  of  trails,  the  cue  was  100%  valid,  50%  valid, 
or  25%  valid  with  respect  to  predicting  the  location  of  the  upcoming  target.  In  the 
100%  valid  condition,  the  cue  was  completely  predictive  of  the  target's  location, 
and  in  the  25%  valid  condition,  it  was  independent  of  the  target's  location.  In  the 
50%  valid  condition,  the  cue  provided  an  intermediate  degree  of  predictive  validity. 
We  were  interested  in  testing  an  attention-allocation  model  in  which  subjects  could 
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allocate  perceptual  samples  to  spatial  locations  according  to  the  information 
provided  by  the  cue.  According  to  the  model,  samples  are  taken  at  a  location  in 
proportion  to  the  probability  that  that  location  contains  the  target.  Evidence  about 
the  identity  of  the  element  in  each  location  accumulates  as  a  random  walk  over 
time,  with  evidence  accumulating  more  rapidly  (i.e.,  a  higher  drift  rate  in  the 
random  walk)  for  locations  from  which  a  larger  number  of  perceptual  samples  have 
been  taken. 

The  predictions  of  this  parallel  model  were  compared  to  those  of  a  serial 
probability-matching  model  (Jonides,  1983).  This  model  holds  that  observers  may 
select  in  advance  of  each  trial  one  location  to  which  they  direct  their  attention,  or 
they  may  divide  their  attention  over  the  entire  display;  subjects'  selection  strategy 
is  assumed  to  vary  from  trial  to  trial  according  to  the  location  and  predictive  validity 
of  the  cue.  If  the  cue  is  1 00%  valid,  observers  always  attend  first  to  the  cued 
location,  and  if  the  cue  is  only  25%  valid,  the  observers  ignore  the  cue.  When  the 
cue  is  50%  valid,  observers  attend  to  the  cued  location  on  50%  of  the  trials  and 
they  distribute  their  attention  evenly  within  the  display  on  the  remaining  trials. 
According  to  the  model,  the  reaction  time  distribution  from  the  50%  valid  condition 
should  be  a  mixture  of  the  distributions  from  the  100%  valid  and  25%  valid 
conditions,  respectively.  Although  the  mean  RT  across  conditions  was  consistent 
with  the  mixture  prediction,  a  more  complete  distributional  analysis  (Yantis,  Meyer, 
&  Smith,  1991)  ruled  out  the  mixture  hypothesis,  thus  undermining  the  viability  of 
the  serial  probability-matching  model.  We  carried  out  additional  control 
experiments  in  which  we  monitored  subjects'  eye  movements  to  ensure  that  the 
effects  we  measured  were  due  to  covert  shifts  of  attention  rather  than  to  overt  eye 
movements. 

We  assessed  the  viability  of  the  parallel  perceptual-sampling  model  by 
running  stochastic  simulations  of  the  model  under  the  conditions  used  in  the 
experiments.  In  the  simulations,  perceptual  samples  were  allocated  to  spatial 
locations  according  to  the  validity  of  the  cue,  and  perceptual  evidence  accumulated 
at  each  location  in  a  random  walk  until  a  criterion  amount  of  information  for  the 
target  was  acquired  (information  can  be  interpreted  as  perceptual  features  of  the 
elements  to  be  identified).  The  simulations  yielded  reaction-time  distributions 
whose  shapes  and  relative  positions  were  compared  to  those  obtained  from  human 
subjects.  The  simulations  suggest  that  the  parallel  perceptual  sampling  model 
provides  a  viable  alternative  to  the  serial  probability-matching  model. 

Our  most  recent  experiments  involved  recording  observers'  eye  position 
during  the  experiment  to  ensure  that  the  effects  we  observed  were  due  to  covert 
shifts  of  attention  and  not  overt  shifts  of  the  eyes. 

2.  Amodal  completion  across  time  and  space  (Yantis,  in  press) 

Another  series  of  experiments  that  we  are  currently  conducting  also  concerns 
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the  possible  role  of  object  representations  in  the  perception  of  apparent  motion  in 
the  Ternus  display.  In  a  well  known  perceptual  phenomenon  known  as  amoda! 
completion,  a  partly  occluded  object  is  perceived  as  complete.  Recent 
investigations  of  completion  suggest  that  it  is  a  truly  perceptual  phenomenon  that 
may  rely  on  some  of  the  same  mechanisms  as  those  responsible  for  illusory 
contours  (e.g.,  Kellman  &  Shipley,  1992;  Sekuler  &  Palmer,  1992).  The  role  of 
amodal  completion  in  perception  is  to  provide  a  coherent  and  continuous 
representation  of  objects  in  the  world  given  a  discontinuous  and  fragmentary  retinal 
image  (due  to  occlusion,  for  example).  All  previous  studies  of  amodal  completion 
have  involved  completion  in  space. 

The  experiments  I  have  carried  out  explore  the  possibility  that  amodal 
completion  may  occur  across  time  as  well  as  across  space.  I  used  a  bistable 
apparent  motion  display  first  described  by  Ternus  (1939/1 926).  There  are  many 
versions  of  the  Ternus  display;  I  will  describe  one  common  one  here.  Two 
elements  (e.g.,  disks)  are  displayed  in  a  subset  of  three  predetermined  locations  in 
the  visual  field  (see  Figure  la).  The  three  locations  are  horizontally  arrayed  about  1 
degree  of  visual  angle  apart.  In  frame  1  of  the  display,  lasting  200  ms,  one 
element  appears  in  the  right  location  and  the  other  object  in  middle  location.  In 
frame  2,  also  200  ms  in  duration,  the  objects  occupy  the  middle  location  and  the 
left  location.  There  is  a  brief  blank  interval  (the  interstimulus  interval  or  ISI) 
between  frames.  Usually  several  such  cycles  are  presented.  The  Ternus  display  is 
perceptually  ambiguous:  it  can  be  perceived  as  involving  either  element  motion  (the 
element  that  begins  on  the  right  hops  end  to  end  over  the  stationary  element  in  the 
middle  location)  or  group  motion  (the  two  elements  move  back  and  forth  together, 
as  a  group,  so  the  identity  of  the  element  in  middle  location  switches  back  and 
forth  from  one  frame  to  the  next).  Figure  2  illustrates  the  motion  correspondences 
under  each  type  of  percept. 

The  probability  that  observers  perceive  group  motion  depends  on  the 
magnitude  of  the  ISI  (see  function  labeled  "No  Occluder"  in  Figure  3):  at  short  ISIs 
(less  than  about  10  ms),  element  motion  is  almost  always  perceived,  and  at  long 
ISIs  (more  than  about  100  ms),  group  motion  is  almost  always  perceived.  The 
probability  of  perceiving  group  motion  increases  monotonically  between  0  and  1 00 
ms  with  the  exact  location  and  slope  of  the  psychometric  function  depending  on 
factors  like  the  size  of  the  objects,  their  spatial  separation,  and  their  duration. 

Observers  viewed  a  standard  Ternus  display  like  the  one  depicted  in  Fig.  la 
on  some  trials,  and  a  Ternus  display  like  the  one  shown  in  Fig.  1b  on  other  trials. 

In  the  latter  displays,  which  are  termed  "occluded  Ternus  displays,"  segments  of 
the  surrounding  rectangle  above  and  below  the  middle  location  are  removed  during 
the  ISI.  This  produces  two  vertically  oriented  illusory  contours  which  support  the 
perception  of  a  "virtual  occluder"  covering  the  middle  location  of  the  Ternus  array. 

The  probability  of  group  motion  reports  as  a  function  of  ISI  for  the  occluded 
Ternus  display  (as  in  Figure  1b)  is  shown  in  Fig.  3  as  the  function  labeled 
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"Occluder";  the  function  labeled  "No  Occluder"  shows  performance  on  trials  in 
which  there  was  no  occluder  (as  in  Figure  la).  The  main  result  is  that  the  location 
of  the  transition  from  element  motion  perception  to  group  motion  perception  is 
centered  at  about  40  ms  for  the  standard  display  and  much  longer  for  the  occluded 
display. 

This  finding  can  be  explained  by  assuming  that  the  occluded  element  was 
amodally  completed  during  the  ISI  when  it  was  occluded  (and  not,  of  course,  when 
it  was  not  occluded).  In  other  words,  the  occluder  provided  an  "explanation"  for 
the  disappearance  of  the  middle  element:  the  element  did  not  disappear,  it  was 
merely  occluded.  Consequently,  the  element  that  was  occluded  still  "persisted" 
during  the  ISI,  and  its  continuity  lead  to  the  perception  of  element  motion  even  at 
very  long  ISIs.  Because  its  phenomenal  persistence  was  prolonged,  when  the 
center  disk  physically  reappeared,  it  was  perceived  as  being  the  same  object  as  the 
one  appearing  in  that  location  during  Frame  1 ,  even  though  its  physical  absence 
was  long  enough  for  any  trace  of  visible  persistence  to  have  faded. 

I  have  carried  out  additional  experiments  using  a  virtual  occluder  presented 
upon  a  semitransparent  ("paint-splattered")  surface  floating  above  the  moving 
elements  using  stereoscopic  viewing  techniques.  The  virtual  occluder  in  these 
experiments  is  much  more  salient  in  this  version  of  the  experiment  and  provides  a 
more  robust  demonstration  of  time-based  amodal  completion. 

3.  Object  continuity  in  visual  attention  and  motion  perception  (Yantis  &  Gibson, 
1994) 


We  have  recently  argued  (see  Yantis  &  Hillstrom,  1 994)  that  the  appearance 
of  a  new  perceptual  objects  in  the  visual  field  appears  to  capture  visual  attention. 
Experiments  from  our  lab  over  the  last  several  years  had  shown  that  the  abrupt 
onset  of  a  visual  element  captures  attention;  Yantis  and  Hillstrom,  using 
equiluminant  visual  elements,  found  that  it  was  not  the  luminance  increment 
accompanying  these  appearances  that  was  responsible  for  attentional  capture,  but 
the  appearance  of  a  new  perceptual  object.  In  the  experiments  described  below, 
we  sought  to  further  explore  the  hypothesis  that  the  visual  system  is  predisposed 
to  attend  to  the  appearance  of  new  perceptual  objects.  The  goal  is  to  link  the 
phenomenon  of  attentional  capture  to  the  perception  of  group  and  element  motion 
in  the  Ternus  (1939/1926)  display. 

One  can  view  the  perception  of  the  Ternus  display  as  reflecting  the 
parameters  that  determine  what  constitutes  a  perceptual  object.  If  the  duration  of 
the  ISI  is  long  enough,  then  the  perceptual  continuity  of  the  object  in  the  middle 
location  can  be  disrupted,  allowing  a  change  in  the  identity  of  that  object,  which  is 
required  for  group  motion  to  be  perceived  (see  Figure  2,  top).  In  contrast,  if  the 
duration  is  quite  short,  then  that  object's  identity  remains  intact,  and  it  can  only  be 
interpreted  as  a  single,  coherent,  continuously-present  perceptual  object,  leading  to 
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the  perception  of  element  motion  (Figure  2,  bottom).  If  the  ISI  at  which  the 
transition  between  element-motion  perception  and  group-motion  perception  in  the 
Ternus  display  represents  the  duration  required  to  disrupt  the  continuity  of  a 
perceptual  object,  then  that  same  interval  might  also  be  sufficient,  when  inserted 
into  an  element  in  visual  search,  to  capture  attention  (recall  that  attention  is 
captured  when  a  new  perceptual  object  appears). 

To  test  this  idea,  we  have  run  several  visual  search  experiments  in  which  one 
element  in  the  visual  display  is  briefly  erased  and  redisplayed  during  the  trial  (Figure 
4).  The  duration  of  the  gap  varies  over  the  same  range  as  the  ISIs  in  the  Ternus 
display.  Attention  is  captured  in  this  experiment  to  the  extent  that  the  duration  of 
the  gap  in  the  target  element  disrupts  the  object  representation  (see  Figure  5, 
closed  symbols  and  right  ordinate):  if  the  duration  of  the  gap  is  very  short,  then  it 
does  not  capture  attention  (attentional  capture  is  indexed  by  the  relative  slope  of 
the  display-size  function:  flat  or  very  shallow  slopes  indicate  complete  attentional 
capture,  steep  slopes  indicate  the  absence  of  attentional  capture).  If  the  duration 
of  the  gap  is  very  long,  then  it  captures  attention  almost  completely  (i.e.,  the  slope 
of  the  visual  search  function  is  near  zero  when  the  target  is  the  element  exhibiting  a 
gap),  and  when  the  duration  of  the  gap  has  an  intermediate  duration,  then  capture 
is  partially  successful.  The  timecourse  of  attentional  capture  is  similar  to  the 
timecourse  of  the  transition  from  element  to  group  motion  perception  in  the  Ternus 
experiment  (Figure  5,  open  symbols,  left  ordinate).  This  result  suggests  that  a 
common  mechanism  may  underlie  attentional  capture  in  my  visual-search  task  and 
the  perception  of  element  and  group  motion  in  the  Ternus  display. 

4.  Perceptual  grouping  and  the  perception  of  bistabie  apparent  motion  (Ternus) 

displays  (Kramer  &  Yantis,  submitted) 

It  has  been  thought  for  a  long  time  that  apparent  motion  perception  is 
insensitive  to  the  form  of  the  objects  rendering  the  motion.  Recent  studies  have 
shown,  however,  that  there  are  small  but  measurable  effects  of  element  identity  on 
disambiguating  apparent  motion  perception.  It  is  less  clear  whether  configurational 
or  grouping  effects  significantly  influence  motion  perception;  available  experimental 
evidence  is  equivocal. 

In  this  series  of  experiments,  we  examined  whether  perceptual  grouping  by 
proximity  could  influence  the  perception  of  element  and  group  motion  in  a  Ternus 
display.  If  figural  properties  of  a  display  are  in  fact  computed  after  a  motion  signal 
is  extracted,  then  one  would  not  effect  perceptual  grouping  to  influence  motion 
perception  in  a  Ternus  display.  We  implemented  the  test  with  displays  like  the  one 
shown  in  Figure  6.  In  each  panel,  two  displays  are  shown,  labeled  "1 "  and  "2"; 
these  are  the  two  frames  of  the  Ternus  display,  with  a  blank  interstimulus  interval 
of  variable  duration  placed  between  them.  Panels  A  and  B  show  conditions  in 
which  the  Ternus  display  is  embedded  in  a  context.  In  Figure  6A,  the  context 
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produces  strong  vertical  grouping,  and  in  Figure  6B,  the  context  produces  strong 
horizontal  grouping.  If  grouping  does  affect  motion  perception  here,  then  we  would 
expect  to  observe  more  element  motion  in  conditions  like  Figure  6A  than  those  like 
Figure  6B,  because  the  context  in  6A  should  serve  to  "anchor"  the  middle  points  in 
place.  Figures  6C  and  6D  represent  control  conditions  without  context,  used  to 
control  for  variations  in  horizontal  separation  in  Figures  6A  and  6B.  If  the  claim 
that  figural  effects  are  computed  after  the  motion  signal  is  extracted  is  correct, 
then  we  would  expect  displays  like  Figure  6A  to  produce  psychometric  functions 
like  those  in  Figure  6C. 

Figure  7  shows  the  results  from  this  experiment.  The  four  functions 
correspond  to  the  four  display  types  shown  in  Figure  6.  There  is  a  small  effect  of 
spacing  (compare  the  top  two  functions),  replicating  previous  findings.  The  most 
dramatic  effect,  however,  is  the  comparison  between  the  vertical  grouping 
condition  in  which  little  group  motion  is  perceived,  with  the  horizontal  grouping 
condition,  in  which  much  more  group  motion  is  reported.  Clearly  there  is  a  strong 
effect  of  grouping  on  bistable  apparent  motion  perception.  The  accompanying 
manuscript  describes  our  results  in  detail. 

5.  Visual  salience  and  attentional  capture  (Yantis  &  Egeth,  1 994) 

When  observers  engage  in  visual  search  for  a  target  defined  by  a  salient 
visual  attribute  (e.g.,  the  target  is  red  and  the  nontargets  are  blue,  or  the  target  is  a 
vertical  bar  and  the  nontargets  are  horizontal  bars),  search  is  typically  extremely 
efficient  (as  indexed,  for  example,  by  the  insensitivity  of  response  time  to  the 
number  of  elements  to  be  searched).  This  visual  "pop-out"  phenomenon  is  widely 
believed  to  reflect  stimulus-driven  or  (bottom-up)  capture  of  visual  attention  by  the 
salient  element.  Indeed,  salience  is  sometimes  defined  as  the  ability  of  an  attribute 
to  capture  attention.  In  virtually  all  studies  that  have  examined  this  question, 
however,  the  salient  attribute  is  explicitly  relevant  in  the  task  set  for  the  observer; 
this  design  feature  makes  it  impossible  to  know  whether  attention  is  captured  by 
the  attribute  in  question,  because  there  is  an  inherent  top-down  or  intentional 
component  to  the  distribution  of  attention  in  these  cases.  In  this  project,  we  have 
shown  clearly  that  salient  visual  attributes  (e.g.,  red  among  blue)  can  be  used  to 
guide  attention  efficiently  when  they  are  known  to  be  relevant  to  the  task,  but 
when  they  are  irrelevant  to  the  observer's  perceptual  goals,  salient  elements  do  not 
capture  attention. 

We  demonstrated  this  simple  result  in  a  visual  search  task  requiring 
observers  to  find  a  vertical  bar  among  slightly  tilted  bars  (a  fairly  difficult  visual 
search  task).  In  the  "color-singleton"  version  of  the  experiment,  three  groups  of 
subjects  participated  in  three  slightly  different  tasks.  The  no-singleton  group  simply 
searched  for  the  vertical  target;  all  stimuli  were  blue.  In  this  condition,  response 
time  increased  linearly  with  display  size  (Figure  8).  The  "target  always  singleton" 
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group  searched  for  a  vertical  target  and  they  were  (correctly)  informed  that  if  the 
target  is  present,  it  would  always  be  the  single  red  bar  among  multiple  blue  bars. 
Response  times  in  this  condition  were  uniformly  fast  and  did  not  depend  on  the 
number  of  elements  in  the  display  (Figure  9),  showing  that  this  salient  attribute  can 
be  used  to  guide  search  efficiently.  This  is  what  is  usually  called  the  "pop-out 
effect."  In  the  "target  10%  singleton"  condition,  a  red  singleton  was  present  on 
every  trial,  but  the  target  was  the  singleton  on  only  1 0%  of  the  trials;  in  other 
words,  the  singleton  was  slightly  /ess  likely  to  be  the  target  than  any  of  the  other 
elements.  If  color  singletons  really  capture  attention,  then  response  times  in  this 
condition  when  the  target  happens  to  be  the  singleton  should  be  just  as  fast  as  in 
the  "target  always  singleton"  condition.  Instead,  response  times  to  singleton 
targets  were  if  anything  slightly  slower  than  to  nonsingleton  targets  (Figure  10). 
This  result  shows  that  when  the  salient  attribute  is  not  part  of  the  observer's 
attentional  set,  it  does  not  have  any  particular  control  over  attention. 

6.  Perceptual  Grouping  and  Attentional  Capture  (Kramer  &  Yantis,  1 995) 

Texture  segregation  and  grouping  have  been  assumed  to  be  closely  related 
processes  by  some  researchers  (e.g..  Beck,  Prazdny,  &  Rosenfeld,  1983;  Treisman 
&  Gormican,  1 988),  while  other  researchers  assume  they  are  qualitatively  different 
(e.g.,  Ben-Av,  Sagi,  &  Braun,  1992).  Some  researchers  assume  texture  segregation 
and  grouping  are  both  preattentive  (Julesz,  1986),  some  assume  that  both  require 
dispersed  attention.  Ben-Av  et  al.  (1992),  on  the  other  hand,  set  out  to  show  that 
an  aggregative  process  like  grouping  requires  attention,  but  that  a  segregative 
process  like  texture  segregation  does  not. 

Ben-Av  et  al.  performed  a  series  of  experiments  in  which  performance  on 
grouping  tasks  and  segregation  tasks  were  compared  under  single  and  dual  task 
conditions.  In  the  dual  task  condition,  subjects  performed  an  attention-demanding 
primary  task  requiring  identification,  along  with  a  grouping  or  segregation  task.  The 
hypothesis  was  that  the  attention  demanding  identification  task  would  significantly 
affect  performance  on  the  grouping  task,  because  this  task  was  assumed  to  require 
attention,  but  that  performance  on  the  segregation  task  would  not  be  impaired, 
because  this  task  does  not  require  attention.  This  is  exactly  what  Ben-Av  et  al. 
(1992)  observed. 

Unfortunately,  claims  that  perceptual  grouping  per  se  is  more  or  less  difficult 
than  texture  segregation  per  se  are  very  difficult  to  sustain,  because  it  is  difficult  to 
equalize  the  difficult  of  the  tasks  used  to  probe  these  two  psychological  processes. 
We  hypothesize  that  by  manipulating  the  strength  of  perceptual  grouping  in  tasks 
similar  to  Ben-Av  et  al's  we  will  find  that  grouping  requires  attention  under  difficult 
grouping  conditions,  but  that  grouping  does  not  require  attention  when  grouping  is 
easy.  Similarly,  by  manipulating  the  difficulty  of  texture  segregation  we  expect  to 
find  that  segregation  does  not  require  attention  when  the  task  is  easy,  but  that  it 
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does  require  attention  when  the  task  is  hard. 

In  our  experiments  we  manipulate  the  strength  of  grouping  by  introducing 
various  degrees  of  rotation  and  jitter  in  the  elements  to  be  grouped,  ranging  from 
no  rotation  or  jitter  (easy  grouping)  to  much  rotation  and  jitter  (hard  grouping).  We 
manipulate  the  difficulty  of  texture  segregation  by  varying  the  similarity  between 
the  target  and  the  background  texture. 

The  results  of  these  experiments  will  reveal  whether  texture  segregation  and 
perceptual  grouping  are  qualitatively  different  by  removing  task  difficulty  as  a 
potentially  confounding  factor. 

7.  On  the  seriality  of  "serial"  search  (Yantis  &  Vaughan,  in  preparation) 

Virtually  all  theories  of  visual  search  assume  that  there  are  certain 
circumstances  in  which  search  proceeds  strictly  serially.  For  example,  a  central 
tenant  of  the  original  Feature  Integration  Theory  (Treisman  &  Gelade,  1980)  was 
that  whenever  a  target  in  visual  search  was  defined  by  a  conjunction  of  two  or 
more  features  that  had  to  be  conjoined,  search  was  necessarily  serial.  The 
evidence  usually  offered  for  seriality  is  the  pattern  of  slopes  in  the  function  relating 
response  time  to  the  number  of  elements  in  the  display  (the  display  size).  Difficult 
visual  search  tasks  often  yield  response  times  that  increase  linearly  with  display 
size;  furthermore,  the  target-present  display-size  function  often  has  a  slope  that  is 
very  close  to  half  that  for  the  target-absent  function.  The  presence  of  a  2:1  slope 
ratio  is  consistent  with  a  serial  self-terminating  (SST)  model  of  visual  search. 
According  to  the  SST  model,  subjects  scan  the  elements  in  the  display  one  at  a 
time  and  make  a  positive  response  as  soon  as  a  target  is  found  (i.e.,  they  terminate 
the  search).  If  no  targets  are  found,  the  display  is  scanned  exhaustively.  If  the 
search  is  random  (as  in  the  case,  for  example,  of  rotated  Ts  in  rotated  Ls),  then  on 
average  (d+  1)/2  of  the  elements  will  have  to  be  scanned  when  the  target  is 
present,  where  d  is  the  display  size.  When  the  target  is  absent,  all  d  of  the 
elements  are  always  scanned. 

As  Townsend  (1974)  pointed  out,  however,  although  increasing  slopes  with 
2:1  slope  ratios  is  consistent  with  an  SST  model,  it  is  also  consistent  with  certain 
limited-capacity  parallel  models.  In  addition,  evidence  subsequently  arose  casting 
doubt  on  this  strong  serial  view.  Pashler  (1987)  found  that  although  display-size 
functions  exhibited  2:1  slope  ratios  for  large  display  sizes  (e.g.,  for  displays  with 
greater  than  8  elements),  the  slope  ratios  were  closer  to  1 :1  for  smaller  display 
sizes.  He  suggested  that  observers  may  search  conjunction  displays  by  "clumps," 
where  search  is  parallel  (but  capacity-limited)  within  a  clump  of  8  elements,  and  but 
serial  between  clumps.  Wolfe,  Cave,  &  Franzel  (1988)  also  found  that  for  certain 
conjunction  searches,  the  display-size  function  had  very  shallow  slopes  that  did  not 
exhibit  2:1  slope  ratios.  So  the  view  that  all  conjunction  searches  are  strictly  serial 
had  to  be  abandoned. 
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Nevertheless,  there  are  certain  classes  of  stimuli  that  are  believed  to  be 
searched  in  a  strictly  serial  fashion,  and  a  form  of  strict  seriality  is  currently  a 
central  assumption  of  Guided  Search  theory  (Wolfe,  1994;  Wolfe  et  al.,  1988). 
Specifically,  in  Guided  Search  it  is  assumed  that  elements  can  be  scanned  only 
serially;  what  differs  among  different  search  tasks  is  the  efficiency  of  the  order  in 
which  the  elements  are  scanned.  In  certain  cases,  elements  that  are  likely  to  be 
targets  based  on  preattentively  available  properties  (e.g.,  local  feature  contrast)  are 
passed  to  the  serial  stage  first,  leading  to  very  efficient  search.  In  other  cases, 
efficient  ordering  of  this  sort  is  not  possible  because  targets  and  nontargets  cannot 
be  distinguished  on  the  basis  of  these  preattentively  available  properties.  In  all 
cases,  however,  search  is  assumed  to  be  strictly  serial. 

More  detailed  diagnostic  procedures  have  been  devised  to  determine  whether 
search  stimuli  are  ever  analyzed  strictly  serially.  Egeth  and  Dagenbach  (1991),  for 
example,  proposed  a  subadditivity  diagnostic  in  which  the  effects  of  visual  quality 
on  one  or  two  elements  in  a  display  are  measured.  They  found  that  their  strict 
criterion  for  seriality  was  satisfied  for  only  one  class  of  stimuli:  rotated  Ts  and  Ls. 
Wolfe  (1  994)  has  called  visual  search  for  a  rotated  T  among  rotated  Ls  the 
"standard  serial  task." 

The  experiments  in  this  project  provide  direct  tests  of  the  SST  model  in  the 
context  of  a  standard  visual  search  task.  The  test  is  based  on  the  RT  mixture 
prediction  made  by  the  SST  model.  Recall  that  when  a  target  is  present,  subjects 
are  assumed  to  scan  elements  in  the  array  one  at  a  time  until  the  target  is  found. 
Let  us  consider  the  case  where  guided  search  is  not  possible:  a  rotated  T  target 
among  rotated  L  nontargets  (Egeth  &  Dagenbach,  1991;  Wolfe  et  al.,  1989).  In 
this  case,  the  elements  are  scanned  in  a  random  order.  For  a  give  display  size  d, 
subjects  will  on  average  find  the  target  after  scanning  one  element  on  Md  of  the 
trials,  after  2  elements  on  Md  of  the  trials,  and  so  forth.  On  Md  of  the  trials  they 
will  be  unlucky  enough  to  find  the  target  only  after  scanning  all  d  of  the  elements. 

If  we  denote  the  response  time  distribution  associated  with  finding  the  target  after 
scanning  j  elements  as  fj(t),  then  the  overall  RT  distribution  when  the  target  is 
present  in  an  unguided  SST  search  is 


d 


f(t)  =  E 

y=i 


where  f(t)  is  the  overall  RT  distribution  and  a  is  a  mixture  parameter  (in  this  case  Oy 
=  Md'i  j).  In  other  words,  the  overall  RT  distribution  is  a  mixture  of  the 
distributions  associated  with  scanning/ elements  (where  /  =  1  to  oO.  Mixture 
distributions  have  several  interesting  and  distinctive  properties  that  are  amenable  to 
statistical  test  (see  Yantis,  Meyer,  &  Smith,  1991,  for  a  comprehensive  review). 
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For  example,  the  overall  mean  RT  /jf  is  also  mixture  of  the  individual  means 
associated  with  scanning  j  elements:  /Jf  =  1  cXj  fj/ .  Furthermore,  the  variance  is  a 
weighted  mixture  of  the  individual  variances  plus  an  inflation  factor  that  depends  on 
the  spread  of  the  means:  a/  =  Z  dj  +  I  ay  (  //y  -  /jf  )^.  The  mixture 
distribution's  variance  will  be  quite  large  relative  to  the  variance  of  any  of  the 
individual  distributions  to  the  extent  that  the  mean  RT  for  the  smallest  and  largest 
display  sizes  are  far  apart  (as  they  usually  are  in  experiments  of  this  sort).  The 
lower  (fast)  tail  of  the  mixture  distribution  must  overlap  completely  with  the  lower 
tail  of  the  fastest  distribution,  f,(t),  and  the  upper  (slow)  tail  must  overlap  with  the 
upper  tail  of  the  slowest  distribution,  fj(t).  Yantis  et  al.  (1991 )  describe  several 
methods  for  testing  hypotheses  involving  RT  mixture  distributions,  including  the 
previously-described  Multinomial  Maximum-Likelihood  Mixture  (MMLM)  analysis 
that  provides  direct  tests  of  particular  mixture  hypotheses  and  that,  unlike  most 
other  techniques,  allow  for  statistical  inference  by  providing  estimates  of  the 
probability  of  Type  I  and  Type  II  errors. 

We  have  carried  out  several  experiments  with  different  ranges  of  display 
sizes  to  test  the  mixture  prediction  of  the  SST  models.  In  all  cases  we  have  found 
that  the  variance  in  the  target-present  conditions  is  too  small  to  be  accounted  for 
by  the  mixture  prediction;  this  seriously  challenges  the  viability  of  the  SST  model. 
We  have  not  yet  completed  our  experiments,  but  the  preliminary  results  are  robust 
and  convincing.  The  main  work  that  remains  is  to  provide  an  alternative  account  of 
the  data  that  is  consistent  with  the  observed  means  and  variances. 
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